SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS 
FOR CONSTRUCTING SAMPLING PLANS FOR ITEMS 
THAT ARE MANUFACTURED 



Related Applications 

This application is a divisional of pending Application No. 09/397,357 filed 
on September 15, 1999. 

Field of the Invention 

5 This invention relates to data processing systems, methods and computer 

program products, and more particularly to statistical systems, methods and computer 
program products. 

Background of the Invention 

10 Sampling plans are widely used in manufacturing environments in order to 

determine whether items are being manufactured at a desired quality level In order to 
construct a sampling plan, the ANSI/ASQ Zl .9 standard generally is used. The 
ANSI/ASQ Z1.9 standard is a collection of sampling plans presented in tabular and 
graphical form. In order to construct a sampling plan, the ANSI/ASQ Z1.9 standard 

15 is studied, and a sampling plan which best matches a desired sampling plan is 
selected. 

Unfortunately, in using the ANSI/ASQ Z1.9 standard, the user may be bound 
to those sampling plans that are set forth in the standard. In particular, specific values 
of error rates, power, sample size and other parameters may be forced upon a user 
20 because the tables may not include the exact criteria that are desired by a given user. 

Moreover, because the ANSI/ASQ Z1.9 standard uses test procedures that are 
based on a non-central t distribution, it may be difficult for the user to interpolate or 
extrapolate between tables of the standard. Notwithstanding these difficulties, the 
ANSI/ASQ Zl .9 standard continues to be widely used in constructing sampling plans 
25 for items that are manufactured. 



Summary of the Invention 

It is therefore an object of the present invention to provide improved systems, 
methods and computer program products for constructing sampling plans for items 
30 that are manufactured. 



- 1 . 



It is another object of the present invention to provide systems, methods and 
computer program products for constructing sampling plans that can be flexible to 
meet the needs of a particular user and manufacturing process. 

These and other objects are provided, according to the present invention, by 
5 inputting into a computer a desired Acceptable Quality Limit (AQL), a desired Key 
Defect Rate (KDR), a desired power of a sampling plan for the items that are 
manufactured and a desired false alarm rate for the sampling plan. The computer then 
calculates a required sample size to provide the desired AQL, the desired KDR, the 
desired power of the sampling plan for the items that are manufactured and the 
10 desired false alarm rate for the sampling plan. Thus, each of the individual 

parameters may be independently specified based on the items that are manufactured, 
desired AQLs, KDRs, power and false alarm rates. Reliance on ANSI/ASQ Z1.9 
tables which might best fit a user's desired parameters can be reduced and preferably 
eliminated. 

15 In addition to calculating the required sample size, a decision rule critical 

value also may be calculated based upon the required sample size to provide the 
desired AQL, the desired KDR, the desired power and the desired false alarm rate for 
the sampling plan. Following the calculations, a relationship between sample size, 
acceptable number of defective items and false alarm rate automatically may be 

20 graphically displayed based upon the desired AQL, the desired KDR and the desired 
power of the sampling plan. 

The items that are manufactured may then be sampled at the required sample 
size to obtain samples, and the number of defective items in the samples or other 
response variables in each of the samples, may be measured. After measuring the 

25 response variables, such as the number of defective items, the measured response 
variable for each of the samples is input into the computer and an estimate of the 
Quality Level (QL) for the items that are manufactured is calculated, based on the 
measured response variable for each of the samples. 

Prior to calculating the required sample size and the decision rule critical 

30 value, a sample distribution that is variance invariant may be calculated based on a 
normal distribution. A percentile grid of sample size and a true process defect rate is 
formulated based on estimated percentiles of a cumulative distribution of the 
sampling distribution. A bias-corrected percentile grid of sample size and the true 



process defect rate is then formulated from the percentile grid. The bias-corrected 
percentile grid is stored in the computer. 

The bias-corrected percentile grid may be used to compute the decision rule 
critical value from the AQL and the false alarm rate, across a plurality of sample 

5 sizes. The bias-corrected percentile grid is evaluated for the decision rule critical 
value, to determine the required sample size. More particularly, the decision rule 
critical value is computed from the AQL, the false alarm rate and the desired sample 
size using the bias-corrected percentile grid of sample size and a true process defect 
rate. The bias-corrected percentile grid is evaluated for values that are larger than the 

1 0 AQL, with the percentile being the desired power. 

After the measured response variable for each of the samples is input into the 
computer, an estimate of the QL may be calculated by computing a bias correction 
coefficient. A QL test statistic is computed as a function of the bias correction 
coefficient and at least one quantile from a cumulative distribution function of a 

15 central t distribution, with at least one argument that is a function of a sample mean, a 
sample standard deviation, the sample size and a specification limit. The computer 
automatically can determine whether the QL test statistic is at least equal to the 
decision rule critical value. 

In another aspect of the invention, after the measured response variables are 

20 input to the computer, the computer calculates a point estimate of the number of out- 
of-specification items that are manufactured based on the measured response variable 
for each of the samples. 

In addition to calculating a required sample size as was described above, the 
present invention also may be used to calculate a KDR that is produced from a desired 

25 sample size. In particular, a desired sample size, a desired false alarm rate, a desired 
AQL and a desired power are input into a computer. The computer calculates a KDR 
that is produced from the desired sample size, the desired false alarm rate, the desired 
AQL and the desired power of the sampling plan for the items that are manufactured. 
Thus, given a desired sample rate, a KDR may be calculated. 

30 The KDR may be calculated by computing a decision rule critical value based 

on the desired AQL and the desired false alarm rate for the desired sample size. After 
the KDR is calculated, a relationship between acceptable number of defective items 
and false alarm rate may be graphically displayed based on the desired AQL, the 
desired KDR and the desired power of the sampling plan. 



As described above, after the calculation is made, the items may be sampled at 
the desired sample size to obtain samples, and a KDR of the items that are 
manufactured may be determined from the samples. After the items are sampled, the 
measured response variable for each of the samples may be provided to the computer, 
5 and the computer can calculate an estimate of the KDR, so that the estimate of the 
KDR can be compared to the KDR that was calculated. After providing the computer 
with the measured response variable for each of the samples, a point estimate of a 
process defect rate for the items that are manufactured also may be calculated based 
on the measured response variable for each of the samples. 

10 The present invention may be embodied in one or more spreadsheets with an 

efficient user interface. The spreadsheets may be used in lieu of the ANSI/ASQ Zl .9 
standard, to allow flexible sampling plans to be constructed and the results of 
sampling to be measured without the need to fit the desired sampling plan to conform 
to one of the series of charts in the ANSI/ASQ Z1.9 standard. It will be understood 

1 5 that the present invention may be embodied as systems, methods and/or computer 
program products. 

Brief Description of the Drawings 

Figure 1 is a simplified block diagram of systems, methods and computer 
20 program products for constructing sampling plans for items that are manufactured 
according to the present invention. 

Figure 2 is a flowchart illustrating operations for calculating a required sample 
size according to the present invention. 

Figure 3 is a flowchart illustrating operations for determining a bias-corrected 
25 percentile grid according to the present invention. 

Figure 4 is a flowchart illustrating operations to calculate an estimate of 
quality level according to the present invention. 

Figure 5 is a flowchart illustrating operations for calculating a KDR according 
to the present invention. 
30 Figures 6A-6F graphically illustrate histograms of Monte Carlo replicates that 

indicate the shape of a probability distribution function for p . 

Figure 7 graphically illustrates bias as a function of sample size, when 
p=AQL. 



Figures 8A and 8B illustrate bias curves across various sample sizes for an 
AQL of 0.0 1 and 0. 1 0, respectively. 

Figure 9 A graphically illustrates bias correction regression for hypothesis 
testing, for n-5. 

5 Figure 9B graphically illustrates bias correction regression for QL estimation, 

forn=5. 

Figure 10 is a screen shot of an attributes work sheet according to the present 
invention. 

Figure 1 1 is a screen shot of the work sheet of Figure 10, for Example 1. 
10 Figure 12 is a screen shot of an attributes data initial spreadsheet, for Example 

2. 

Figure 13 is a screen shot of an attributes data final spreadsheet, for Example 

2. 

Figure 14 is a screen shot of a variables spreadsheet according to the present 
15 invention. 

Figure 15 is a screen shot of a variables data spreadsheet, for Example 3. 
Figure 16 is a screen shot of a variables data spreadsheet 14, for Example 4. 
Figure 17 is a screen shot of a variables data analysis spreadsheet according to 
the present invention. 

20 Figure 1 8 is a screen shot of a variables data analysis spreadsheet, for Example 

5. 

Detailed Description of Preferred Embodiments 

The present invention now will be described more fully hereinafter with 
25 reference to the accompanying drawings, in which preferred embodiments of the 
invention are shown. This invention may, however, be embodied in many different 
forms and should not be construed as limited to the embodiments set forth herein; 
rather, these embodiments are provided so that this disclosure will be thorough and 
complete, and will fully convey the scope of the invention to those skilled in the art. 
30 Like numbers refer to like elements throughout. 

The present invention can provide a simple, flexible tool to a non-statistical 
audience that can allow a user to both construct an appropriate variables sampling 
plan to meet desired needs, as well as to analyze the data once it is collected. 
Specifically, the present invention can allow the user to prescribe the Acceptable 



Quality Limit (AQL), Key Defect Rate (KDR), the power of the test, and the level 
(also referred to as either a type I error or the false alarm rate) of the test. These four 
values may then be used to determine the required sample size as well as the critical 
value for the test statistic. 
5 Moreover, according to the invention, the user can specify a sample size, and 

given the AQL, power and false alarm rate, determine the resulting KDR. This aspect 
of the invention can enable the user to evaluate the adequacy of a specific sample size 
for the particular items being manufactured. 

The above-described parameters may be calculated, according to the 
10 invention, using a central t distribution and extensive Monte Carlo simulation that can 
provide the user with a flexible, efficient, non-tabulated tool to monitor and maintain 
the AQL of a process via acceptance sampling. 

The sampling plans may be constructed using Microsoft® Excel workbooks or 
other spreadsheets, so that special statistical software need not be used and the wide 
15 range access of users to Microsoft Office software may be leveraged. 

In the Detailed Description that follows, a General Overview of the present 
invention will first be provided. Then, a Detailed Theoretical Description will be 
provided. Finally, a Detailed Description of a User Interface for the present invention 
will be provided. 

20 

1. General Overview 

Referring now to Figure 1 , a simplified block diagram of the present invention 
is shown. As shown in Figure 1, systems, methods and computer program products 
for constructing a sampling plan for items that are manufactured, according to the 

25 invention, may be provided using a personal computer including a Central Processing 
. Unit (CPU) 100, a keyboard 110, a mouse 120 and/or other user input devices, and a 
display 130 and/or other user output devices. The CPU 100 may include an operating 
system 140 and a spreadsheet 150. Sampling plans 160 according to the present 
invention may be included, for example, as spreadsheet workbooks. 

30 In a preferred embodiment of the present invention, the CPU 100 may be a 

standard IBM-compatible, Apple or other personal computer, the operating system 
may be Microsoft Windows and the spreadsheet 150 may be Microsoft Excel. 
However, it will be understood that other general purpose computers, including 
mainframe, midrange, workstation and/or personal computers and/or one or more 



applications running on one or more of these computers, and/or special purpose 
hardware may be used. 

Various aspects of the present invention are illustrated in detail in the 
following figures, including flowchart illustrations. It will be understood that each 

5 block of the flowchart illustrations, and combinations of blocks in the flowchart 

illustrations, can be implemented by computer program instructions. These computer 
program instructions may be provided to a processor or other programmable data 
processing apparatus to produce a machine, such that the instructions which execute 
on the processor or other programmable data processing apparatus create means for 

10 implementing the functions specified in the flowchart block or blocks. These 

computer program instructions also may be stored in a computer-readable memory 
that can direct a processor or other programmable data processing apparatus to 
function in a particular manner, such that the instructions stored in the computer- 
readable memory produce an article of manufacture including instruction means 

1 5 which implement the functions specified in the flowchart block or blocks. 

Accordingly, blocks of the flowchart illustrations support combinations of means for 
performing the specified functions, combinations of steps for performing the specified 
functions and program instruction means for performing the specified functions. It 
also will be understood that each block of the flowchart illustrations, and 

20 combinations of blocks in the flowchart illustrations, can be implemented by special 
purpose hardware-based computer systems which perform the specified functions or 
steps, or by combinations of special purpose hardware and computer instructions. 

Prior to providing a general overview of the present invention, an overview of 
terms, symbols and abbreviations common to this statistical field are listed in Table 1 : 



Table 1 



Term/Symbol/ 
Abbreviation 


Meaning 


Defect 


Any item that does not meet ail relevant acceptance criteria 


AQL 


Acceptable quality limit. The numerical value for the process defect rate used to 
calibrate the false alarm rate of the sampling plan 


KDR 


Key defect rate. The smallest value for the process defect rate that will be detected 
with a preset "high probability" when using the sampling plan. The numerical value 
for the process defect rate used to calibrate the power of the sampling plan 


Ho 


Null hypothesis. The statement being tested. Usually, this claim contains an 
"equality". 


H, 


Alternate hypothesis. A statement in opposition to the to the null hypothesis. Usually, 
this is the claim hoped to be proven; hence it is sometimes referred to as the Research 
hypothesis. In quality settings, this is usually connected with a process shift that must 
oe aeiccieu. 


a 


Type I error. The probability of rejecting a lot when the process defect rate equals 
AQL. Also called the false alarm rate. 


ft 
P 


i ypc ii ciiur. iiic pruoiiDiiiiy kji ovi-cpiing a ioi wiicii nic proccsb ucieci rate equals 
KDR. 


power 


The probability of rejecting a lot when the process defect rate equals the KDR. 


n 


The sample size. 


P 


i ne irue, aioeu unKiiuwn, proceab ueieci rate. 


left-sided 


A setting where only a lower specification limit is used. 


right-sided 


A setting where only an upper specification limit is used. 


two-sided 


A setting where both lower and upper specification limits are used. 


OC curve 


Operating Characteristic Curve. A plot of the relationship between the type II error for 
uie iebi ^vcixicdi axibj duu uie cieieci raie vjiurizoiiuti dxiii j. 


AOQ curve 


Average outgoing Quality Curve. A plot of the relationship between the long-term 

r\i if rrr\in <x H ft ratf* ( \ff*rt\r , a 1 qyic^ gnr) tVir* Hf*fV»r*f mtf* CY\t\rt'7f\r\t^\ civic 
UUlgUUlJ* lalC ^VClllVal aXlbJ allU U1C UCICCl Ialv ^IIUI 1Z.UI1UU dAIb 


AOQL 


Average Outgoing Quality Limit. The maximum long-term expected outgoing defect 
rate. 


Attributes Data 


/• . . 


X 


The number of defective items in the sample. 


c 


The acceptable number of defective items in the sample. Also called the decision rule 
critical value. 


Variables Data 


P 


The estimate for the process defect rate. Also called the quality level, or QL, estimate. 


k L 


The lower specification limit for the response variable in original units. 


ku 


The upper specification limit for the response variable in original units. 


K 


The decision rule critical value. 


y 


The mean, or average, of the sample data. 


s 


The standard deviation of the sample data. 



It will be understood by those having skill in the art that Variables Data relates to 
continuous measurements of various parameters, whereas Attributes Data relates to 
integer values of actual results. 
5 The present invention may be used to determine a required sample size and 

also to determine a KDR. Figure 2 is a flowchart of operations for calculating a 
required sample size according to the present invention. In general, a bias-corrected 
percentile grid is determined and input into the computer at Block 210. At Block 220, 
the desired AQL, KDR, power and false alarm rate are input into the computer. At 

10 Block 230, a decision rule critical value is computed based upon the required sample 
size to provide the desired AQL, the desired KDR, the desired power and the desired 
false alarm rate. The decision rule critical value preferably is determined across a 
plurality of sample sizes using the bias-corrected percentile grid of sample size and a 
true process defect rate. Then, the bias-corrected percentile grid is evaluated for the 

15 decision rule critical value to determine the required sample size. 

In particular, the decision rule critical value, K, is computed based on the 
numerical values for AQL and a, across several potential sample size values. For a 
selected sample size, the bias corrected percentile grid is used to obtain the numerical 
value for K, where p equals AQL and the percentile used is 1 -a. Interpolation may be 

20 used when necessary. Given K, the bias corrected percentile grid is evaluated with p 
= KDR and the percentile equaling the power. This second grid result should equal K 
for one particular sample size. Again, interpolation may be used when necessary. 

Continuing with the description of Figure 2, at Block 250, the relationship 
between the sample size, acceptable number of defective items and false alarm rate 

25 optionally is displayed. Other relationships also may be displayed in a graphical 

and/or table format. At Block 260, the items are sampled and the response variable is 
measured. Finally, at Block 270, the measured response variable for each of the 
samples is input into the computer and a calculation is made of an estimate of a 
quality level (QL) of items that are manufactured, based on the measured response 

30 variable for each of the samples. 

Referring now to Figure 3, details of determining the bias-corrected 
percentage grid (Block 210 of Figure 2) will now be described. At Block 310, a 
sampling distribution that is variance invariant is calculated. It is assumed that the 



response variable of interest from a process possesses a normal distribution with mean 
\i and variance a 2 . It also is assumed that the response variable is given certain 
specification limits, namely a lower specification limit, Icl, and/or an upper 
specification limit, ky. The true underlying proportion, p, of non-conforming (i.e. 
out-of-specification or defective) items produced by the process is a function of both 
\i and <r 2 . Specifically, any of these three parameters can be solved for given 
numerical values for the other two parameters. A preliminary Monte Carlo study 
concluded that for a fixed value of p, the sampling distribution for the (left-sided case) 
statistic 

/ > \ 



10 p = P 



r < 



pi 



with T* ~ t B _ 



is invariant to the selection of a 2 . The right-sided and two-sided cases follow 
analogously. 

The mathematical form of the distribution characterizing p is presently 
unknown, except by a vague and very complex expression. However, Monte Carlo 

1 5 simulation may be used to estimate the percentiles of the cumulative distribution for 
p , as well as to estimate the expected value for p . Both are functions of the sample 
size, n, as well as the true process defect rate, p. Therefore, at Block 320, a percentile 
grid is established in three dimensions: (1) the sample size, (2) the true process defect 
rate, and (3) the percentile to be referenced. 

20 Unfortunately, the point estimate p is a biased estimate, specifically an 

overestimate, of the true process defect rate p. However, p is asymptotically 
unbiased; that is, the bias goes to zero as the sample size gets large. For each value of 
p, a regression model of the bias as a function of the sample size may be used to 
determine the multiplication constant needed to eliminate the bias. Thus, at Block 

25 330, a bias corrected percentile grid is constructed by taking the Monte Carlo based 
percentile grid and multiplying each entry by an appropriate constant. 

Referring now to Figure 4, detailed operations to calculate an estimate of 
quality level (Block 270 of Figure 1) will now be described. In particular, a bias- 
correction coefficient, based on the assumption that p — AQL, is computed at Block 

30 410. The QL test statistic is then computed at Block 420 as a function of this bias- 



correction coefficient and quantile(s) from the cumulative distribution function of the 
central t-distribution with an argument(s) that is/are a function of the sample mean, 
sample standard deviation, sample size, and specification limit(s). Finally, at Block 
430, the decision rule is defined as follows: reject the hypothesis that the process is 
5 operating under the condition of an AQL defect rate if and only if the QL test statistic 
is greater than or equal to the decision rule critical value, K. Otherwise, the process is 
deemed to be operating under an acceptable defect rate. 

Referring now to Figure 5, according to another aspect of the present 
invention, the KDR that is produced from a desired sample size, a desired false alarm 

10 rate, a desired AQL and a desired power, may be calculated. As shown in Figure 5, a 
bias-corrected percentile grid may be calculated at Block 210, as was described in 
connection with Figures 2 and 3. It will be understood that the calculation of the bias- 
corrected percentage grid may only need to be performed once and then may be used 
for all subsequent evaluations and calculations. Then, at Block 510, an acceptable 

15 AQL, a desired false alarm rate (a), a desired sample size (n), and a desired power at 
the unknown KDR are input into the computer. 

At Block 520, the decision rule critical value is computed based on the desired 
AQL and the desired false alarm rate for the desired sample size. In particular, the 
decision rule critical value, K, based on the numerical values for AQL, and a is 

20 computed for the given sample size. Hie bias corrected percentile grid is used to 

obtain the numerical value for K, where p equals AQL and the percentile used is 1-a. 
Interpolation may be used when necessary. Given K, the bias corrected percentile 
grid is now evaluated across several values of p > AQL, with the percentile equaling 
the power. This second grid result should equal K for one particular value of p, which 

25 becomes the computed value for the KDR. Again, interpolation may be used when 
necessary. 

Continuing with the description of Figure 5, optionally at Block 530, a 
relationship between acceptable number of defective items and false alarm rate may 
be displayed. Alternatively, other relationships may be displayed. 
30 Then at Block 260, the items are sampled and the response variables are 

measured as was described in connection with Figure 2. Finally at Block 540, a KDR 
that is produced from the desired sample size, the desired false alarm rate, the desired 
AQL and the desired power of the sampling plan is calculated. More particularly, an 



estimate of the KDR is calculated for the items that are manufactured based on the 
measured response variable for each of the samples. Moreover, a point estimate of a 
process defect rate for the items that are manufactured may be calculated based on the 
measured response variable for each of the samples. These parameters may be 
displayed. 

In particular, to obtain a point estimate for the process defect rate, an alteration 
of the QL test statistic may be performed. This is due to the form of bias-correction 
coefficient, which no longer may utilize the assumption that p = AQL that results 
from the formal hypothesis test. A family of reference regression models (each 
defined by the sample size) of the true defect rate versus the expected value of p is 
constructed based on the Monte Carlo results that produced the original percentile 

A 

grid. The QL point estimate, QL , is a transformation of p . For the given sample 
size, the coefficients from the appropriate reference regression model may be used to 

A 

transform p into QL . Interpolation may be used as necessary. 

2. Detailed Theoretical Description 

A detailed theoretical description of the statistical theory underlying the 
present invention now will be provided. 

The process parameter of interest is the probability, p, that an individually 
produced item is not within specifications. Hence, p is referred to as the process 
defect rate. Generally, it is only desirable to protect against an excessively large value 
for p. The formal hypothesis statement takes the form of a one-sample proportion 
test, namely 

H 0 : p = po 
Hi:p = pi>po 

where po equals the acceptable quality level, or AQL. The false alarm rate, a, for this 
test is controlled at a specific level under the circumstance that p = AQL. Further- 
more, the power of this test is calibrated when the process defect rate equals a value 
referred to as the key defect rate, or KDR. Thus, p = KDR, where KDR - pi > po- 

There are two scenarios to be considered when constructing the sampling plan. 
In both cases the AQL, a, and power are fixed in advance. In the first case, the KDR 
is specified in advance and the required sample size is computed. In the second case, 



the sample size is specified in advance with the resulting KDR being computed. In 
either case, it is of interest to estimate the unknown process parameter p from the 
observed (collected) data during the data analysis. 



2.1 Test Procedure Developmental Theory 

Assume that a lower one-sided specification limit is employed. Let 
p = P( y < k L ) be a point estimate for p, where kL represents the lower specification 
limit and y represents any future observation. Assume that the response variable is 
normally distributed with mean \i and variance a 2 , written as y ~ n[ jj., a 2 ], and that 
the observations are all independent of one another. From distribution theory, with y 
being the sample mean and s being the sample standard deviation arising from a 
sample of n observations: 

2 



and 



So, 



Then, 



y-y~N 



n 



Xn-l * 



T* = 



y-y 




n[o,i] 



V n-l 



k n-l* 



-1 



P = P 



\ n) 



, with T* - t 



n-1' 



It is noted that the test statistic is the cumulative distribution function, or cdf, value of 
a t statistic at some specific value, but it is not a t statistic itself. 

The right-sided and two-sided cases may follow analogously. For the two- 
sided specification case, the test statistic is: 



r 




<T* < 



ku -y 




while for the right-sided specification case, the test statistic is: 



P-P 




5 In both cases, ku represents the upper specification limit. The sample statistic p may 
be the foundation on which to build a new test procedure. 

2.2 The Decision Rule 

A goal of the present invention is to provide a sample statistic that can truly 

1 0 estimate the process defect rate, not just some artificial statistic constructed via 

distribution theory, while not requiring relinquishing power as compared to the ANSI 
test procedure. A primary benefit can be a better understanding of the ramifications 
on product quality of a particular numerical value for the sample statistic. The 
hypothesis testing procedure chooses between the AQL and KDR values as to which 

15 is the more appropriate description of the underlying process defect rate. The critical 
value, K, for the hypothesis test falls between the AQL and KDR, where the value for 
K also can provide information concerning sampling plan precision (in meaningful 
terms) to the user. Specifically, the user can know that certain process defect rates 
above the AQL will likely be indistinguishable from the AQL. 

20 Overall, the hypothesis test is righWailed, with the statement "the process 

defect rate equals the AQL" being rejected whenever the sample statistic exceeds the 
critical value. Whether the actual specifications for the response variable themselves 
are one-sided or two-sided can be immaterial regarding this decision rule. 

The value of K is derived from the null distribution (i.e. assuming that 

25 p = AQL ) of the test statistic. The right-tailed region of this distribution having an 

area of a provides the appropriate value for K in order to obtain a hypothesis test at an 



a level of significance. To attain a specific power for the test, the alternate 
distribution of the test statistic, evaluated at p = KDR , is used. 

As stated above, either the KDR or the sample size may be fixed in advance. 
Given the KDR, the power determines the sample size, which in turn is used with the 
5 AQL and a to compute K. Alternately, given the sample size, K can be computed 
directly from the AQL and a. In this case, calculating the KDR for a specific power 
is useful in determining the adequacy of the sampling plan, but the value of the KDR 
may have no direct bearing on the decision rule. 

10 2.3 The Sampling Distribution of p 

To incorporate the sample statistic p into a formal hypothesis test, the 
sampling distribution for p is obtained. Unfortunately, the mathematics may become 
complex. Assuming a left-sided specification case, the cumulative distribution 
function, G, for p is: 



( kL-y 



15 G(w) = Jg 



ff(t)d. 



dz, 



where g(z) is the probability density function for p and f(t) is the probability density 
function of a t distribution having n-1 degrees of freedom. However, the form of the 
function g(z) is generally unknown. 

Without an expression (closed-form or otherwise) to define the sampling 

20 distribution for p , Monte Carlo simulation may be employed as one method of 

circumventing the complex distribution theory. There is, however, one assumption 
that should be met, as will now be described. 

Setting the process defect rate to some constant value has the effect of fixing 
the tailed area(s) under a normal curve to this constant value. As the normal 

25 distribution is a two-parameter (p., a 2 ) distribution, there are an infinite number of 

(H,cr 2 ) pairs that will satisfy this tailed area criterion. However, if is also fixed, then 
a 2 can be solved for explicitly, and vice versa. A result from a previous Monte Carlo 
simulation study is that the sampling distribution for p is invariant to the selection of 
the a 2 ) pair. For the sake of simplicity, a ? can therefore be arbitrarily set to 1 , 



with p. computed directly from the true process defect rate. The random normal 
variates used in the Monte Carlo simulation are then fully defined. 

Monte Carlo simulation produces an empirical sampling distribution for p 
that is a function of both the sample size and the true process defect rate, p. The 
5 100(l-a) th percentile of the empirical cumulative distribution function, or ecdf, for p 
can be used for determining K. 

To illustrate the sampling distribution of p graphically, consider two process 
defect rates (0.01 and 0.10) along with three sample sizes (5, 30, and 100). Figures 
6 A through 6F represent histograms of the Monte Carlo replicates (10,000 for each 

10 case) and indicate the shape of the probability distribution function for p . It appears 
as though smaller sample sizes exhibit more right-skewedness than do larger sample 
sizes. It also appears that as the process defect rate becomes larger (and fixing the 
sample size), symmetry and normality are more closely adhered. Overall, the 
suggestion is that the sampling distribution for p is unimodaL Given that p > 0, 

15 many continuous distributions, such as the Gamma and Weibull, could be investigated 
as to their potential use as approximations. Since, 0 < p < 1 is the real domain for p , 
a logical choice would be a Beta distribution. Further discussion concerning the 
modeling of the sampling distribution of p will be presented in Section 2.7.2. 

20 2.4 The Bias of 6 

Distribution theory suggests that p estimates p, but Monte Carlo simulation 
reveals an interesting fact: p is a biased estimator of p. This fact is illustrated in both 
Table 2A and Table 2B, where the mean of the Monte Carlo replicates for the three 
illustrative sample sizes are all above the true value for p (for either p - 0.01 or p = 
25 0.10). These outputs also show that the means approach the true value of p as the 

sample size increases. This means that p is an asymptotically unbiased estimator of 
P. 



Table 2 A: 



Descriptive Statistics for Monte Carlo Replicates when p = 



0.01. 





Variable 


N 


Mean 


Median 


TrMean 




StDev 


SE Mean 


5 


n=5 


10000 


0.05367 


0.04165 


0.04952 


0. 


04648 


0.00046 




n=30 


10000 


0.01685 


0.01422 


0.01589 


0. 


01196 


0.00012 




n«100 


10000 


0.01196 


0.01107 


0.01165 


0. 


00559 


0.00006 




Variable 


Minimum 


Maximum 


Ql 


Q3 








10 


n-5 


0.00001 


0.31664 


0.01726 


0.07769 










n-30 


0.00001 


0.09197 


0.00795 


0.02305 










n«100 


0.00078 


0.04245 


0.00781 


0.01518 









Table 2B: Descriptive Statistics for Monte Carlo Replicates when p = 0.10. 

15 



20 



Variable 


N 


Mean 


Median 


TrMean 


StDev 


SE Mean 


n=5 


10000 


0.15066 


0.13736 


0.14530 


0.09877 


0.00099 


n=30 


10000 


0.10936 


0.10614 


0.10820 


0.04161 


0.00042 


n-100 


10000 


0.10272 


0.10190 


0,10236 


0.02331 


0.00023 


Variable 


Minimum 


Maximum 


Ql 


Q3 






n-5 


0.00002 


0.80619 


0.07248 


0.21287 






n=30 


0.00779 


0.26957 


0.07905 


0.13589 






n=100 


0.03693 


0.19259 


0.08628 


0.11825 







25 

The bias is positive, meaning that on average, p will estimate the process 
defect rate as being slightly higher than it truly is, with the quantification of slight 
being a function of the sample size and, to a lesser degree, the true process defect rate. 
Therefore, p may be construed as being a conservative estimate of p. Figure 7 shows 

30 the bias surface across both the sample size and the true process defect rate (here, 
p = AQL). In Figures 8 A and 8B the bias trend is illustrated across various sample 
sizes for a fixed process defect rate. 

In terms of the hypothesis testing framework, there do not appear to be any 
procedural difficulties stemming from the bias of p . The false alarm rate and power 

35 can define the appropriate tailed regions of the null and alternate sampling 

distributions, respectively, and a sampling plan can be constructed. When performing 
the data analysis, however, an estimate of the observed process defect rate should be 
available. In this light, improving p is desirable, especially when dealing with small 
sample size scenarios. 
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2.5 Bias Correction Regression 

For a fixed sample size, the relationship between the true value for p and the 
expected value for p will be described. It is determined that the relationship between 
loglo(E n p [p ]) and loglO(p ) can be expressed with a cubic polynomial linear 

5 model. Figure 9A illustrates this relationship for a sample size of n = 5. The r-square 
is roughly 100%, with all three model terms being significant. Hence, there may be 
no need for overfitting. This high degree of fit may be attained for all sample sizes. 

In the hypothesis testing framework, the true value for p is assumed to be 
either the AQL or the KDR when determining either the null distribution or the 
10 alternate distribution. Therefore, the assumed true value for p can be used as the 
independent variable in the bias correction regression, and the expected value for p 

can be predicted. The sample statistic , 1 p then becomes an unbiased test 

E„,pLp] 

statistic, where p = AQL when dealing with the null distribution and p = KDR when 
dealing with the alternate distribution. 

15 A different problem presents itself when the post-sampling data analysis is 

performed. When estimating the process defect rate, the bias correction regression is 
reversed. Since p is unknown, it may not be used as an independent variable. Using 
the sample statistic p as the independent variable E n p [p ], the true value for p can be 
estimated. Figure 9B shows that a cubic polynomial linear model (again with dual 

20 logl 0 transformations) performs well here as well. 

There may be one inherent difficulty with the second bias correction 
regression technique: the distribution of p is not symmetric, thus replacing E n p [p ] 

by p may not eliminate the bias, only help to reduce it. 

25 2.6 The Percentile Grid 

The Monte Carlo simulation study to estimate the sampling distribution for p 
involves a rather extensive set of (n, p) pairs. Specifically, the sample sizes included 
in the study are n = 3,5(5)400(10)1000(200)2400,2500 . The process defect rates 
included in the study are p - 0.0005, 0.001, 0.004, 0.0065, 0.01(0.01)0.15. These 

30 sample sizes and process defect rates are fully crossed, with 10,000 Monte Carlo 



replicates of p computed for each (n, p) pair. This large number of replicates is 
compressed into a set of percentiles for each (n, p) pair, using these percentiles: 
{0.000 1, 0.0005, 0.00 1, 0.005, 0.0 1(0.0 1)0.99, 0.995, 0.^99, 0.9995, 0.9999} 
These uncorrected sets of percentiles are then filtered through an appropriate (a 
5 function of the sample size) bias correction regression equation to produce the final 
sets of corrected percentiles, called the percentile grid because each row represents an 
(n, p) pair and each column a particular percentile. 

A large number of (n, p) pairs were used for two primary reasons. First, the 
capturing of subtle curvilinear trends is desired, especially since the problem at hand 
1 0 mostly involves the extremes, or tails, of sampling distributions. Second, with a fine 
resolution grid, a linear interpolation procedure should provide an acceptable 

i. 

computational method for addressing any value for n, p, or percentile not represented 
explicitly by the grid. Other smoothing procedures may work as well, but the trends 
being smoothed generally are monotone and the smoothing procedure may need to 
15 account for this. As a counterexample, local quadratic fitting may not guarantee a 
monotone fit. 

Regarding the linear interpolation procedure, the problem is essentially three- 
dimensional. There is the sample size, the process defect rate, and the appropriate 
percentile (used to represent either the false alarm rate or power). The linear 

20 interpolation procedure used in the Excel workbooks described below incorporates a 
nearest neighbor weighted average computation. To accomplish this, a 2 3 factorial 
cube of upper and lower grid bounds is determined to surround the point of interest. 
Each dimensional range is scaled to 1, then a Euclidean distance is computed for each 
of the cube corner points to the point of interest. Higher weights should be given to 

25 the smaller Euclidean distances. This is accomplished through the use of a harmonic 
mean expression of the distances (which includes a constraint to have the weights sum 
to one). Finally, a weighted average is calculated. 

2.7 Distributional Approximations 
30 The percentile grid method of estimating the sampling distribution of p may 

be considered an inelegant, brute force method. However, it may be generally 
desirable to have an accurate representation over an elegant representation (if the cost 
is accuracy). Several different methods may be used to include more elegant 
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distributional representations of the various items of interest, 
summarized in the next two sections. 



These methods are 



2.7.1 Modeling the Critical Value and the Power 

A consideration was made of modeling the critical value, K, as a nonlinear 
function of the sample size, the AQL and the false alarm rate, and to model the power 
of the test as a function of the sample size, the AQL and the KDR. From a modeling 
standpoint, both regression models were highly successful. The general regression 
model for the critical value was found to be 

K~p o exp{-p 1 n P2 } + p 3 Vn-fp 4 n, 
with the last two terms helping to stabilize the asymptotic trend. Because the power 
of the test is bounded between 0 and 1 , there is a definite need to use a model that has 
a horizontal asymptote at a power equal to one. This leads to the statistical model for 
the power of the test being given by 

1 

power 



l + Y 0 exp{-Yiii Yl } 
Given the (AQL, KDR, a) triple, the sample size is found via 

1 -power 
Y 0 power J 



n = 



-± to 



v 



, where Y 0 , y, , y 2 ~ f (\QL, KDR, a), 



T, 

with the function f representing the dependence of the coefficients on the values for 
AQL, KDR, and a. Once the sample size is found, the critical value, K, is computed 
via 

K = p o exp{-p i n fj2 } + p 3 V^ + P 4 n, where P 0 , ft, (5 2 , P 3 ,p 4 ~ g(AQL, a), 
with the function g representing the dependence of the coefficients on the values for 
AQL and a. 

There may be two major difficulties with this approach, First, the false alarm 
generally rate adds another dimension to the modeling problem. Second, a different 
model may be needed for each potential value of KDR - AQL. Together, these may 
lead to an enormous reference grid of coefficients. A solution to the problem may be 
to find a unified model for the critical value, and another unified model for the power. 
This means that the coefficients themselves are modeled as functions of the sample 
size, the AQL and the false alarm rate (in the critical value case), or the sample size, 



the AQL, the false alarm rate and the KDR (in the power case). A unified model for 
the critical value was developed, but a unified power model was not developed. In 
any event, the noise produced through the estimation process of a unified model 
seemed to suggest that there would be some loss in accuracy. Hence, a different 
5 approach was warranted. 

2.7.2 Modeling With Known Continuous Distributions 

As stated in Section 2.3, the sampling distribution of p for a given (n, p) pair 
can be modeled with a continuous distribution. Even better may be the development 

10 of a unified model, where the coefficients themselves are a function of both n and p. 
The two-parameter beta distribution is a logical choice for modeling the 
sampling distribution of p because 0 < p < 1 . However, distributional analysis over 
several (n, p) pairs found that both the two-parameter Weibull distribution and the 
two-parameter Gamma distribution generally fit the data better than does the beta 

15 distribution. Even so, there are mixed results; sometimes the fit is very good, but 
other times the tails are not well represented. This is important because, due to the 
magnitude of both the false alarm rate and of the power, tail precision generally needs 
to be maintained. 

A three-parameter distribution is suggested as the minimally parameterized 
20 distribution that will adequately describe the sampling distribution of p . The three- 
parameter Weibull distribution and the generalized three-parameter beta distribution 
also may be considered. These distributions may include a true third parameter to aid 
in shape description, not just a "phase shift" parameter that moves the domain, but 
does not impact on the shape. 
25 Accordingly, the percentile grid presently provides a more accurate estimation 

algorithm than other continuous distribution representations. 

2.8 The PL Test 

To summarize Sections 2.1-2.7.2, the test statistic used to discriminate 
30 between the AQL and the KDR is defined to be 

AQL , 



QL 0 = 



E AQL „[p] 



where p is as defined in Section 2.1, and E AQL n [p ] is the predicted expected value 

for p as computed via the bias correction regression equation (conditioned on n) 
outlined in Section 2.5 under the assumption that p = AQL. . 

The decision rule is to reject the null hypothesis that the process defect rate 

A 

5 equals the AQL provided that QL 0 > K . The value of K is determined via the 
percentile grid, with linear interpolation used whenever necessary. 
The actual quality level point estimate is defined as: 

QL = io 0o+e,loelo( P^ Gl(Io ^ lo( P^ 2+83(!o8lo( ^ 
where 9 is a function of the sample size. As stated before, this is a conservative 
10 estimate of the process defect rate since, on average, the true process defect rate is 
slightly less than this estimated value. 

3. Detailed Description of User Interface 

The user interface preferably includes three customized Microsoft Excel 
15 workbooks, as follows: 

-Attributes Ms y creates sampling plans for attributes data. 

-Variables _SP.xls, creates sampling plans for variables data. 

-Variables _DA.xls ; analyzes variables data via the QL test. 
However, other spreadsheet programs or other applications programs or custom 
20 programs maybe used. 

Attributes data, being dichotomous in nature, are evaluated with respect to 
binomial distribution theory (assuming "infinite" lot sizes). This is consistent with 
the widely accepted methodology for producing sampling plans. 

Variables data preferably is analyzed via the QL test, as was described above. 
25 The usual normal distribution framework still applies, but an improvement in power 
and a vast improvement in flexibility may be provided over the ANSI standard 
approach. 

An overview of the hypothesis test framework as it pertains to either attributes 
data or variables data first will be presented. 

30 

3.1 Attributes Data 

The hypothesis test is of the form 



H 0 : p = AQL, 
Hi: p > AQL, 

with the lot being rejected if X > c . Following the binomial distribution for large lot 
sizes, the false alarm rate, or type I error, is determined from the relation 

n f n \ 

5 AQL*(l-AQL)"- x . 

With c fixed and (3 set for a prescribed KDR, the sample size is determined, regardless 
of the AQL, by solving the following expression for n: 

p = z["] KDRX ( i - KDR r x - 

Without a closed-form expression, the sample size can be computed in an iterative 
10 manner. 
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3.2 Variables Data 
The hypothesis test is of the form 
H 0 : p = AQL, 
Hi: p > AQL. 
For a left-sided specification case, the test statistic for the QL test is 

f \ 



p=p 



Af n J 



, withT'-t,,., 



It is noted that the test statistic is the cdf (cumulative distribution function) value of a t 
Statistic at some specific value, but it is not a t statistic itself. 
For the two-sided specification case the test statistic is 

f ' \ 

k L-y < K -y 



P = i^p 



si + i 
V V n 



'sfl 



withr~t n _,. 



Finally, for the right-sided specification case the test statistic is 



p=p 



T > 



K -y 



withT ~t n _ v 



The decision rule is one-sided regardless of whether the specifications are left-sided, 
right-sided, or two-sided. Namely, the lot is rejected if p > K . 

5 4. Overview of the Microsoft Excel Workbooks 

As stated above, three (3) Microsoft Excel workbooks may be provided: 

-Attributes _SP.xls ^ creates sampling plans for attributes data. 

- Variables SP.xls, creates sampling plans for variables data. 

-Variables _DA.xls y analyzes variables data via the QL test. 
1 0 There need be no Attributes_DA.xls file because there generally is no need for a data 
analysis spreadsheet when the attributes decision rule is simply whether or not the 
number of observed defectives exceeds some critical value. 

Section 5 contains detailed illustration of how various scenarios are addressed 
with these workbooks. 

15 

5. Detailed Description for Using the Microsoft Excel Workbooks 
5.1 Attributes_SP.xls 

Figure 10 shows the layout of the worksheet Attributes SP.xls. The control 
panel is set up in columns B through F. Three (3) charts appear in columns G through 
20 P: 

-OC Curve. (Upper Left) A plot of the probability of accepting a lot (vertical 
axis) against the process defect rate (horizontal axis). 

-AOQ Curve. (Upper Right) A plot of the outgoing defect rate (vertical axis) 
against the process defect rate (horizontal axis). 
25 -Maintaining the Key Defect Rate. (Bottom) A plot of the a error (vertical 

axis) against the critical value (horizontal axis). The data labels represent the 
sample size required to maintain fixed values for both the KDR and the power 
at KDR. 

As seen in Figure 10, only eight (8) cells can be changed. See Table 3: 

30 : 



Table 3 



Editable Cell 


Purpose 


D4 


User specified n 


D5 


User specified c 


D6 


AQL 


D7 


p error (becomes target when maintaining the KDR) 


D16 


User specified KDR 


E29 


Maximum defect rate for OC and AOQ curves 


E31 


Maximum critical value for Maintaining the KDR graph 


E32 


Nominal a error, required when maintaining the KDR 



A user has two options available for the construction of a particular sampling 

plan: 

Specify the sample size, critical cutoff value, AQL, and (3 error, 
then evaluate this sampling plan by viewing the resulting a 
error and KDR. The "QC curve" and "AOQ curve" can then be 
used to get a more general perspective of the sampling plan and 
determine if it is practically feaLsible and still functional. 

2. Specify the KDR, p error, AQL, a error (target value), and 
critical value, then determine the required sample size. It is 
recommended that initially the critical value be set to 0, with 
15 the "Maintaining the KDR" graph providing its own evaluation 

of the sample size and critical value pair that keeps the true a 
error as close to the targeted (or nominal) value as possible. 
The procedure for each of these two cases is outlined in the following two 
sections. To alleviate intermittent calculations from becoming a nuisance, the 
20 workbook has the manual calculation option selected. Therefore, the F9 key is used 
to recalculate the worksheet whenever necessary. 



1. 
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5.1.1 Attributes Data Sampling Plan: Unspecified KDR Case 

The first scenario considered is the evaluation of a user-specified sampling 
25 plan. The sample size is entered in cell D4, with the critical value in cell D5, the 
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AQL in cell D6, and the P error in cell D7. The resulting a error is displayed in cell 
D9, with the computed KDR found in cell Dl 0. 

Example 1 

5 Suppose that a user wants to evaluate a sampling plan involving a sample size 

of n = 50 items with a c = 0 acceptance criterion. The AQL is to be 0.65% and the 
test should provide 90% power. 

Results: Example 1 

1 0 Entering the four given values into cells D4:D7 as outlined above, it is seen 

that the a error is 27.82%, with the KDR being 4.50%. A maximum defect rate of 6% 
(slightly above KDR) is entered in cell E29 in order to provide a decent horizontal 
axis scale. The spreadsheet is recalculated using the F9 key. Figure 1 1 illustrates this 
spreadsheet, which now shows an AOQL of 0.73%. The Maintaining the KDR graph 

1 5 indicates a setup problem due to the empty D 1 6, B3 1 , and E3 1 cells. However, this 
scenario does not attempt to control the KDR, so this error message should be 
ignored. 

5.1.2 Attributes Data Sampling Plan: Specified KDR Case 

20 The second scenario considered is to solve for the sampling plan parameters 

(n, c) in order to maintain a specific KDR at a given power under the conditions of a 
prescribed AQL and associated a error. The AQL is entered in cell D6, with the p 
error in cell D7 and the KDR in cell D 1 6. In addition, the targeted a error is entered 
in cell E3 1 , with a maximum critical value in cell B3 1 and an initial critical value in 

25 cell D5. 

Example 2 

Suppose that it is desired to maintain a KDR of 3% with 95% probability. The 
false alarm rate is to be a = 10% when the process is at AQL = 1% . It is desired to 
30 compute the sampling plan parameters: namely the sample size, n, and critical cutoff, 
c. 



Results: Example 2 

Ignore cell D4, the sample size entry. Arbitrarily set c == 0 , thus entering "0" 
in cell D5. Then, enter "1" (AQL = 1%) in cell D6, followed by entering "5" 
CP = 5% ) in cell D7. Next, enter "3" ( KDR = 3% ) in cell D16. The targeted a error 

5 is entered as "10" in cell E3 1 , with a maximum critical value of "15" in cell B3 1 . 
Recalculate the spreadsheet using the F9 key. From this, it is seen in Figure 12 
(viewing cell D17) that n = 99 is needed. However, under these conditions, the false 
alarm rate is highly unacceptable (see cell D23), as a = 63% . Lowering n would 
lower a but increase the KDR. Increasing n will increase a. Therefore, c should be 

10 increased. 

To avoid forcing the user into performing an iterative search, the Maintaining 
the KDR chart is consulted. It is seen that the recommended sampling plan has 
parameters n=392 and c=6. 

It will be understood that in Figure 12, the OC curve and AOQ curve indicate 

15 setup problems, which are also signaled with warning lights for cells D4 and E29. 
Entering a "392" into cell D4 and a "6" into cell E29, along with a maximum defect 
rate (here use 6%, so enter a "6") in cell E29, finalizes the procedure. Lowering the 
maximum critical value to "7" in cell B3 1 would reduce the computation time needed 
for the recalculation (the F9 key), although this is not required. Figure 13 displays the 

20 final spreadsheet layout for this example. Here it is seen that the AOQL is 0.97%. 

While a sample size of 392 may appear large, examine the constraints of the 
problem. The difference between AQL and KDR is only 2%, and higher precision 
generally means higher sample size. In addition, with both AQL and KDR being 
srnall numerically, the scenario is also affected by the rare events identification 

25 problem. This situation also may require larger sample sizes. 

5.2 Variables SP.xls 

Figure 14 shows the layout of the variables data sampling plan worksheet 
Variables_SP.xls. The control panel is set up in columns B through J. Two (2) charts 
30 appear in columns L through Q: 

-OC Curve. (Upper Left) A plot of the probability of accepting a lot (vertical 
axis) against the process defect rate (horizontal axis). 
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-AOQ Curve. (Upper Right) A plot of the outgoing defect rate (vertical axis) 
against the process defect rate (horizontal axis). 

As seen in Figure 14, only seven (7) cells can be, changed. See Table 4. 



Table 4 



Editable Cell 


Purpose 


F4 


AQL 


F5 


a error 


F6 


power 


E9 


User specified n 


G9 


User specified KDR 


C18 


Maximum defect rate for OC arid AOQ curves 




Case to plot(l or 2) 


C19 


1 = prescribe n 




2 = prescribe KDR 
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A user has two options available when constructing a particular variables data 
sampling plan: 

1 . Specify the KDR, p error, AQL, and a error, then determine 
the required sample size. The critical value is determined as a 
function of AQL and a. 



2. Specify the sample size, AQL, and a error, and power, then 
evaluate this sampling plan by viewing the resulting KDR. 
1 5 Again, the critical value is determined as a function of AQL 

and a. 

The OC curve and AOQ curve can be used to critique the resulting sampling 
plan for practical feasibility and functionality. The procedure for each of these two 
cases is outlined in the following two sections. Unlike Attributes_SP.xls, the 
20 workbook VariablesJSP.xls does not have the manual calculation option selected, so 
the F9 key is not needed here. 



5.2.1 Variables Data Sampling Plan: Unspecified KDR Case 

Consider now the evaluation of a user-specified variables data sampling plan. 
25 The AQL is entered in cell F4, with the a error in cell F5 and the power in cell F6. 



The sample size is entered in cell G9. For plotting purposes, a maximum defect rate 
is; entered in cell CI 8 and the case identifier "2" is entered in cell CI 9. 

Example 3 

5 Suppose that the user wants to evaluate a sampling plan involving a sample 

size of n = 50 items. The AQL is to be 0.65% with a false alarm rate of a = 5% , and 
the test should provide 90% power. 

After entering a "0.65" in cell F4, a "5" in cell F5, a "90" in cell F6, and a 
"50" in cell G9, the resulting KDR of 4.025% is displayed in cell GIL Also, the 
10 critical value of 2.263% is displayed in cell G12. With the KDR being 4.025%, a 
maximum defect rate of 5% seems reasonable for the two plots. Thus, a "5" is 
entered in cell C18, with the case identifier "2" entered in cell C19. Figure 15 shows 
the final spreadsheet layout for example 3, where it is seen that the AOQL is 1.02%. 

1 5 5 .2.2 Variables Data Sampling Plan: Specified KDR Case 

Next, consider the construction of a variables data sampling plan when the 
KDR is fixed in advance. Also given is the power at the KDR, the AQL, and the a 
error. The AQL is entered in cell F4, with the a error in cell F5 and the power in cell 
F6. The KDR is entered in cell E9. For plotting purposes, a maximum defect rate is 

20 entered in cell CI 8 and the case identifier "1" is entered in cell CI 9. 

Example 4 

Suppose that it is desired to obtain 95% power at KDR = 3% , with the false 
alarm rate being a = 5% when the process is at AQL = 1% . It is desired to compute 
25 the sampling plan parameters: namely the sample size, n, and critical value, K. 

Results: Example 4 

After entering a "1" in cell F4, a "5" in cell F5, a "95" in cell F6, and a "3" in 
cell E9, the resulting sample size of 174 is displayed in cell El 1 . Also, the critical 
30 value of 1.86% is displayed in cell El 2. With the KDR being 3%, a maximum defect 
rate of 5% seems reasonable for the two plots. Thus, a "5" is entered in cell CI 8, with 
the case identifier "1" entered in cell CI 9. Figure 16 shows the final spreadsheet 
layout for example 4, where the AOQL is 1 .05%. 

on 



It is the ease at which Example 4 is performed that illustrates the gains in 
flexibility that the QL test procedure has over the ANSI standard approach. The next 
section deals with the analysis of variables data when using the QL test procedure. 

5 5.3 Variables DA.xls 

The general spreadsheet layout of the variables data analysis workbook 
Variables_DA.xls is shown in Figure 17. The data analysis summary is self-contained 
within one graph. Annotation is provided to report the particulars of the hypothesis 
test, the point estimate for the quality level, and an explicit conclusion. 

10 As seen in Figure 1 7, there are eight (8) editable cells devoted to the setup of 

the problem, and one long editable data field. See Table 5. 



Table 5 



Editable Cell 


Purpose 


Editable Cell 


Purpose 


C4 


Response name 


G4 


AQL 


C5 


Lower specification 
limit 


G5 


KDR 


C6 


Upper specification 
limit 


G6 


K, the critical value 


C11:C2510 


Raw data entry 


G7 






G8 


power 



15 While the workbook can handle response variables with either a one-sided or 

two-sided specification range, only the left-sided specification range is illustrated 
here. 

Example 5 

20 Suppose that the response variable is separation force (ft.lbs.), and the lower 

specification limit is set at 8 ft. lbs. Assume that the conditions of Example 4 apply 
here: 95% power at KDR = 3% , a = 5% at AQL == 1% . The sample size is n = 174 
and the critical value is K = 1 .86% . 

25 Results: Example 5 
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To begin the analysis, enter the response name "Separation Force" into cell 
C4, and the lower specification limit equaling "8" in cell C5. The upper specification 
limit cell 06 is left blank. Then, the hypothesis test information is entered: a "1" in 
cell G4, a "3" in cell G5, a "1.86" in cell G6, a "5" in cell G7, and a "95" in cell G8. 
5 Finally, the data is placed in cells CI 1 :C1 84. It is seen in Figure 1 8 that the QL 
estimate is 0.84% and that the conclusion is to accept the null hypothesis that the 
process defect rate is at AQL. 

The normal curve displayed on the chart uses the sample mean and sample 
standard deviation as its parameters, with the mean being overlaid onto the curve. 
10 The specification limit(s) is(are) highlighted as well. However, it is again stressed 
that the QL estimate is not just the area under the curve beyond the specification 
limit(s): this area would not account for the random variability of either sample 
statistics used to generate the normal curve. This curve does, however, offer a simple 
graphical guide for ancillary purposes. 
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6. Conclusion 

As shown, the QL test procedure for acceptance sampling can improve the 
ANSI standard procedure by allowing a tremendous gain in flexibility regarding the 
specification of the sampling plan parameters. By eliminating the need to use the vast 

20 quantity of ANSI tables and graphs and replacing them instead with an Excel 
workbook, improved construction of sampling plans can be provided. A less 
intimidating, less confusing, and more convenient tool to aid the non-statistically 
oriented user in constructing and analyzing acceptance sampling plans for variables 
data can be provided by the QL test. 

25 In terms of computational issues, the results presently are mixed. It is true that 

the non-central t distribution and its complexity in obtaining tailed areas and critical 
values has been avoided. However, a large percentile grid may need to be stored, and 
the linear interpolation procedures can be rather involved. The QL test procedure 
does have the potential of becoming very computationally efficient. In particular, a 

30 unified model for the sampling distribution of p may be developed. Such a model 

could eliminate the need of any interpolation algorithms and could make the sampling 
plan construction computations closed-formed. 
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It also has been shown that the three (3) Microsoft Excel workbook files can 
facilitate the construction of sampling plans for either attributes data or variables data. 
It also has been shown that the flexibility of the QL test procedure can be a vast 
improvement over sifting through a book of standards to obtain a sampling plan that 

5 may or may not be adequate. Computer resources have made obsolete the "one-size- 
fits-all" philosophy. With the growing trend toward statistical training in industry, the 
notion of using tables and charts for the sole purpose of keeping things parsimonious, 
at the expense of accuracy or correctness, is also outdated. Finally, it is recommended 
that since sampling plans can be constructed with the KDR specified in advance, it 

10 may be in the best interest of the user to determine and incorporate a value for the 
KDR via practical considerations. This action can protect the process from 
undetected drifts into an unacceptably high defect rate region. 

In the drawings and specification, there have been disclosed typical preferred 
embodiments of the invention and, although specific terms are employed, they are 

1 5 used in a generic and descriptive sense only and not for purposes of limitation, the 
scope of the invention being set forth in the following claims. 



